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Example: Vacuum Cleaner Agent 


♦ agent: robot vacuum cleaner 

♦ environment: floors of your apartment 

♦ sensors: 

■ dirt sensor: detects when floor in front of robot is dirty 

■ bump sensor: detects when it has bumped into something 

■ power sensor: measures amount of power in battery 

■ bag sensor: amount of space remaining in dirt bag 

♦ effectors: 

■ motorized wheels 

■ suction motor 

■ plug into wall? empty dirt bag? 

♦ percepts: "Floor is dirty” 

♦ actions: "Forward, 0.5 ft/sec” 
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< Rational Agent 

o 

o 

b 

o ♦ Performance Measure: Criteria for 
§ determining the quality of an agent’s 

V behavior 

.S' ■ Example: dirt collected in 8 hour shift 

| ♦ Avoiding Omniscience 

3 .An omniscient agent is one that can predict 
| the future perfectly. We don't want this! 

= ♦ Agent: Mapping from percept sequences 

& to actions 

6 
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Defn: Ideal Rational Agent 


♦ For each percept sequence, choose the 
action that maximizes the expected value 
of the performance measure given only 
builtin knowledge and the percept 
sequence 


(c) 2003 Thomas G. Dietterich and 

Devika Subramanian 23 


1 







Oregon State University - CS430 Intro to AI Oregon State University - CS430 Intro to AI 

















Oregon State University - CS430 Intro to AI Oregon State University - CS430 Intro to AI 


Avoiding Tables (2) 


Summarizing Percepts as 
Environment Model 


♦ Summarizing the Percept Sequence 

• By analyzing the sequence, we can 
compute a model of the current state of the 
world. With this state, the agent can act as 
if the world is a Markov environment 



Policy 
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Pseudo-Code 



Goal-Based Agents 


♦ Generate possible sequences of actions 

♦ Predict resulting states 

♦ Assess goals in each resulting state 

♦ Choose an action that will achieve the goal 

♦ We can reprogram the agent simply by changing the goals 
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Goal-Based Agents compute the 
desired action on demand 


Example of Computing Table 
Dynamically 


♦ In many cases, the agent can compute 
the desired action rather than looking it 
up. This trades extra CPU time to 
reduce memory. 

• Example: Deep Blue 
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Problems with Computing Table 
Dynamically 

♦ Search space may be exponentially large 

■ Computing the best action may be computationally 
intractable 

♦ World may change while we are searching 

■ In a dynamic environment, we must act promptly 

♦ Knowledge of the world may be incomplete or 
wrong 

■ We may not be able to accurately predict the future 


< Utility-Based Agents 

o 

o _ 

C 

^ ♦In some applications, we need to make quantitative 
comparisons of states based on utilities. Important 
u when there are tradeoffs. 
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PEAS Descriptions 


♦ P: Performance Measure 

♦ E: Environment 

♦ A: Actuators 

♦ S: Sensors 



Different Kinds of Environments 


Examples of Environments 


Fully-observable vs. Partially-observable 

■ Fully-observable = Markov 

Deterministic vs. Stochastic 

■ Strategic: deterministic except for the actions of 
other agents 

Episodic vs. Sequential 
Static vs. Dynamic 
Discrete vs. Continuous 
Single agent vs. Multiagent 


Crossword 

puzzle _ 

Chess w/clock | 
Poker 

Backgammon 
Taxi driving 
Medical Dx 
Image analy 
Part-picking 
Refinery contr 
English tutor 


Deterministic 


Deterministic 


Strategic 


Strategic 


Stochastic 


Stochastic 


Stochastic 


Deterministic 


Stochastic 


Stochastic 


Stochastic 


Episodic Static Discrete Agents? 


Sequential Static Discrete Single 
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Advantages of Simpler 
Environments 

♦ Observable: policy can be based on only most 
recent percept 

♦ Deterministic: predicting effects of actions is 
easier 

♦ Episodic: Do not need to look ahead beyond 
end of episode 

♦ Static: Can afford lots of time to make 
decisions 

♦ Discrete: Reasoning is simpler 
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